Robust Neuro-Fuzzy Speaker Localization Using a Circular Microphone Array
نویسندگان
چکیده
A major application area of microphone array processing is the localization of sound sources, mainly of speaking persons. In contrast to most state-of-the-art approaches that are based on correlation measures, we propose a neurologically inspired system that generalizes findings about human spatial hearing to the multi-channel case. It mimics the processing in the human cochlea and the auditory mid-brain. To enhance the localization quality, a new spike generation approach is introduced, termed peak-over-average position (PoAP). A fuzzy combination is used to remove putative artifacts. In contrast to a human listener we employ multiple sensors to gain robustness in reverberant and noisy environments. Post-processing estimates the locations of concurrent speakers. The robustness of the proposed system is shown by comparison with the wellknown steered response power approach. Finally, we show the applicability of our realtime neuro-fuzzy model to the concurrent speaker localization task using real reverberant recordings.
منابع مشابه
Reverberation-Robust One-Bit TDOA Based Moving Source Localization for Automatic Camera Steering
We address the problem of moving acoustic source localization and automatic camera steering using one-bit measurement of the time-difference of arrival (TDOA) between two microphones in a given array. Given that the camera has a finite field of view (FoV), an algorithm with a coarse estimate of the source location would suffice for the purpose. We use a microphone array and develop an algorithm...
متن کاملRobust Speaker Localization Utilizing a Novel Beamforming Algorithm Based on Harmonic Structures
Speaker localization by microphone array has recently received significant attention. Although various methods have been proposed; their performance with short data segments under noise and reverberation degrades considerably. Sound localization based on Steered Response Power (SRP) shows more robustness in practical situations especially with the use of short data segments. In SRP-PHAT algorit...
متن کاملRobust speech recognition with speaker localization by a microphone array
This paper proposes robust speech recognition with Speaker Localization by a Arrayed Microphone (SLAM) to realize hands-free speech interface in noisy environments. In order to localize a speaker direction accurately in low SNR conditions, a speaker localization algorithm based on extracting a pitch harmonics is introduced. To evaluate the performance of the proposed system, speech recognition ...
متن کاملAdaptive beamforming and soft missing data decoding for robust speech recognition in reverberant environments
This paper presents a novel approach to combine microphone array processing and robust speech recognition for reverberant multi-speaker environments. Spatial cues are extracted from a microphone array and automatically clustered to estimate localization masks in the time-frequency domain. The localization masks are then used to blindly design adaptive filters in order to enhance the source sign...
متن کاملA Multi-Sensor Object Localization System
This paper presents a localization and tracking system integrating multiple sensors. Object localization results from local sensor systems are fused using a decentralized Kalman filter. An audiovisual speaker tracking system is evaluated, which is based upon a video based face tracker and a microphone array. A quantitative analysis shows that the presented bimodal tracking system can deliver mo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010